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Abstract. We prove the main conjecture of the paper "On the 
expected value of the minimum assignment" by Marshall W. Buck, 
Clara S. Chan, and David P. Robbins (Random Structures & Al- 
gorithms 21 (2002), no. 1, 33-58). This is a vast generalization 
of a formula conjectured by Giorgio Parisi for the n by n random 
assignment problem. 



1. The Parisi formula 

This work is motivated by a conjecture made in 1998 by the physicist 
Giorgio Parisi |P98j . Consider an n by n matrix of independent exp(l) 
random variables. Parisi conjectured that the expected value of the 
minimal sum of n matrix entries of which no two belong to the same 
row or column, is given by the formula 



n 
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i=l 

An equivalent setting is obtained by considering the expected minimum 
cost of a perfect matching in a complete n by n bipartite graph with 
independent exp(l) edge costs. 

The problem had already received quite some attention. At the time, 
the main open question was the value of the limit of the expected op- 
timal value as n tends to infinity. A non-rigorous argument due to 
Marc Mezard and Parisi |MP85j showed that the limit ought to be 
C(2) = 7r 2 /6. David Aldous |A92j proved, using an infinite model, that 
the limit exists. The striking conjecture of Parisi (obviously consistent 
with the conjectured C(2)-limit) paved the way for an entirely new ap- 
proach. It seemed likely that ^ would yield to an inductive argument, 
and that therefore the £(2)-limit could be established by exact analy- 
sis of the "finite n" case. The Parisi formula was almost immediately 
generalized by Don Coppersmith and Gregory B. Sorkin [HB98J to 
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for /^-assignments in an m by n matrix, k < min(m, n). It is not hard 
to verify that (J2J) specializes to (JIJ) when k = m = n. 

The Coppersmith-Sorkin conjecture was then generalized in two dif- 
ferent directions by Marshall W. Buck, Clara S. Chan, and David P. 
Robbins [BCR02 and by the present authors LWQH]- In a recent paper 
|LW03| . we prove our own conjecture, thereby establishing the Parisi 
and Coppersmith-Sorkin formulas. Remarkably, a proof of these formu- 
las was announced simultaneously by Chandra Nair, Balaji Prabhakar 
and Mayank Sharma [NPS03] . 

In this paper we state and prove a simultaneous generalization of the 
Buck-Chan-Robbins conjecture and the main theorem of LW03 . Not 
only is this a stronger result than that of |LW03j . but it also provides 
a considerable improvement of the proof. By combining the approach 
of |BCR02| with that of |LW03| . we obtain a proof of Parisi's formula 
which is shorter, simpler, and gives a far better insight into the problem 
than our original proof. 



2. The Buck-Chan-Robbins formula 

In this section, we state the formula conjectured by Buck, Chan and 
Robbins in jBCK.02| . This is a "combinatorial" formula, involving a 
binomial coefficient. As is shown in [BCR02 , there is an equivalent 
"probabilistic" version of the formula, and one can pass between the 
two via elementary properties of the Mobius function. In this paper we 
work in the probabilistic setting. In order to prove our main theorem, 
Theorem I9.1[ it is therefore not necessary to take the detour through 
the combinatorial formulas. The purpose of this section is therefore 
mainly to introduce some notation and provide the background. We 
remark, however, that both in [LWOOJ and |BCR02| . the discovery of 
the probabilistic formulas was made through formal manipulation of 
the combinatorial formulas. Therefore the latter have played an im- 
portant role in obtaining the results of this paper. 

Let M be an m by n matrix of nonnegative real numbers. The rows 
and columns of the matrix will be indexed by weighted sets R and C 
respectively. We may take R = {1, . . . , m} and C = {1, . . . , n}, but the 
sets R and C come with weight functions wr and wc respectively that 
associate a positive weight to each element of the set. A k- assignment 
is a set 7r C R x C of k matrix positions, or sites, of which no two 
belong to the same row or column. An assignment will also be called 
an independent set. The cost of tt is the sum 

cost A f(7r) = ^ M (hJ) 

(i,j)eir 

of the matrix entries in tt. We let minfc(M) denote the minimum cost 
of all fc-assignments in M. 
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In [BCR02J, the Parisi and Coppersmith-Sorkin conjectures are gen- 
eralized to a certain type of matrix with entries which are exponential 
random variables, but not necessarily with parameter 1. We say that a 
random variable x is exponential of rate a if Pr(x > t) = e~~ at for every 
t > 0. In this case we write x ~ exp(a). Buck, Chan and Robbins con- 
sidered the following type of matrix: For every G R x C, M(i,j) 
is exp(wR(i)w(7(j))-distributed, and independent of all other matrix 
entries. To state the formula, we use the following notation: If X is a 
set of rows, we let Wr{X) = ^2 ie x w R(i)i an d wr(X) = Wr(R\X). We 
use similar notation for sets of columns. 

Theorem 2.1 (Conjectured by Buck, Chan and Robbins 2000). Let 

M be a matrix as described above. Then 

'm + n-l-\X\- \Y\\ (_i)fc-H*H>1 



(3) E[min fc (M)] = ^ 



x - /; v k-l-\X\-\Y\ J w R (X)w c (Y) 

YCC 

Notice that in order for the binomial coefficient to be nonzero, we 
must have \X\ + \Y\ < k, which ressembles the condition i + j < k 
in the Coppersmith-Sorkin formula (J2J). It is still not entirely obvious 
that (jHJ) specializes to the Coppersmith-Sorkin formula when the row- 
and column weights are set to 1. However, in BCR02J, the formula (jSJ) 
is shown to be equivalent to a formula given by an urn model. We will 
generalize this to a setting where a certain set of matrix entries are set 
to zero. 

3. Main Theorem 



The main theorem of |LW03j is a formula for the expected value of 



the minimal /c-assignment in a matrix where a specified set of entries are 
set to zero, and the remaining entries are independent exp(l)-variables. 
In this article we prove a formula for the common generalization of the 
matrices considered in IBCR02| and in (LW03j . We say that M is 
a standard matrix if the entries in a certain set Z of sites are zero, 
and the remaining entries are independent and distributed according 
to the row- and column weights, that is, M(i,j) ~ exp(wn(i)wc(j))- 
This is an obvious generalization of the concept of standard matrix in 
LW03| . As in |LW001 ILW031 IBCR02| we give two seemingly different 



but equivalent formulations of our main theorem. 

Let Z C R x C be a set of sites. A file is a row or a column. Let A 
be a set of files. We say that A is a cover of Z if every site in Z lies in 
a file that belongs to A. By a cover of the matrix M we mean a cover 
of the set of zeros of M. By a k — 1- cover we mean a cover consisting 
of k — 1 files. Finally by a partial k — 1-cover we mean a subset of a 
k — 1-cover. 

Let Jfc(M) be the set of partial k — 1-covers of the zeros of M. 
Let Jfc(M) denote the poset consisting of Jjfc(M) ordered by inclusion, 
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together with an artificial largest element 1. The k — 1-covers are 
coatoms in Jk(M). Let \i denote the Mobius function on intervals in 
Jfc(M) (see e.g. [S| for the basics of Mobius functions). 

The following is a combinatorial formulation of our main theorem. 

Theorem 3.1 (Main Theorem, combinatorial version). Let M be a 
standard matrix. Then 

(4) E[mMM)]= £ ~ J W% - 

If there are no zero entries in M, Jfe(M) consists of all sets of at most 
k — 1 files. The poset Jfc(M) is a truncated Boolean lattice obtained by 
deleting all elements of rank > k except the top element in the Boolean 
lattice B m+n . The fact that (jlj) specializes to (J3J) follows from the fact 
that the Mobius function of the truncated Boolean lattice occurring in 
is given by the signed binomial coefficient in (JHJ). 

Example 3.2. Let M be a standard 2x2 matrix with no zeros, and 
let the row- and column weights be wn(i) = ai, and wc(j) = bj. With 
k = 2, the poset J2(M) consists of six elements: The bottom element 
is the empty set. There are four elements of rank 1 consisting of one 
file, and then there is the top element 1. The Mobius function on the 
interval (0, 1) is equal to 3, and the Mobius function on the intervals 
from the rank 1 elements to the top element is —1. 



i 




Hence according to (jlj 

(5) E[min 2 (M)] = 

-3 1111 
(ai + a 2 )(bi + b 2 ) a 2 (b 1 + b 2 ) 01(61 + 62) 6 2 (ai + a 2 ) 6i(di + a 2 )' 

If we set the weights equal to 1, (0) specializes to 5/4, in accordance 
with the Parisi formula 1 + 1/4. If we compare to 0, we see that the 
numerators in (JHJ are indeed equal to the binomial coefficients in (JHJ) 
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4. Matrix reduction 

Polynomial time algorithms for computing min^M) for a given (non- 
random) matrix M are well-known. We do not focus here on issues of 
computational efficiency, but we outline an algorithm whose special 
features will be of importance. The following lemma expresses a fun- 
damental property of optimal assignments. It is proved in [BCR02 , 
although these authors make no claims of originality. The first state- 
ment is certainly well-known, but we haven't been able to trace the 
second statement to any other source than BCR02J. For complete- 
ness, we include an outline of the proof. 

Lemma 4.1 (Nesting Lemma). Let Mbe a real m by n matrix, and 
let ki < &2 < min(m, n) be positive integers. If fi is an optimal k\- 
assignment in M , then there is an optimal k2-assignment \J in M such 
that every file that intersects fi also intersects p! . Moreover, if v is an 
optimal ki- assignment, then there is an optimal k\-assignment v' such 
that every file that intersects v' intersects v. 

Sketch of proof. We may assume that k\ = k 2 — l. Suppose that fi is an 
optimal fci-assignment, and v is an optimal /^-assignment. Consider 
the symmetric difference 5 = fiAv. We say that two sites are adjacent if 
they are in the same row or column. The components of 5 with respect 
to adjacency are cycles or paths. Suppose that 5' is a subset of 6 which 
consists of a number of entire components of 5, in other words such that 
no site in 5' is adjacent to a site in 5\5' '. Suppose moreover that 5' is 
balanced in the sense that it contains equally many sites from fi and v. 
Then fiAb' is a /^-assignment, and uA5' is a /^-assignment. It follows 
that cost(/i fl 5') = cost(f PI 5'). Hence in the components of S that 
are balanced, the cost of the sites in /z is equal to the cost of the sites 
in v, and for all the other components, the difference in cost between 
the sites in fi and the sites in v is the same (with a sign depending on 
which one of /i and v is overrepresented) . As a consequence, we may 
take a single component 5i of 5 so that 5i has one more site in v than 
in fi. Then fi' = fiA5\ is an optimal ^-assignment, and v' = vA8\ 
is an optimal /^-assignment, and it is straightforward to verify that fi 1 
and v' have the desired properties. □ 

Let Z C R x C be a set of sites. We say that a cover of Z is optimal if 
it has the minimum number of files among all covers of Z. The rank of 
a set of sites is the size of the largest independent subset. The following 
is a famous theorem due to Denes Konig: 

Theorem 4.2 (Konig's theorem). The number of files in an optimal 
cover of Z is equal to rank(Z). 

The following theorem forms the basis of an algorithm for computing 
minfc(M): 
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Theorem 4.3. Let M be a nonnegative m by n matrix, and let X be an 
optimal cover of M. Suppose that there is no zero cost k+1- assignment 
in M. Then every file in X intersects every optimal k-assignment in 
M. 

Proof. Let /i be an optimal fc-assignment. Since there is no zero cost 
k + 1- assignment, k > |A|. By Lemma l4.1| there is an optimal |A|- 
assignment \J such that /i intersects every file that intersects ^ '. By 
Konig's theorem, there is a zero cost | A [-assignment, and since \J is op- 
timal, this means that // has zero cost. Hence every file in A intersects 
fi'. The statement follows. □ 

The following matrix operation is fundamental for the algorithm. We 
refer to it as matrix reduction. Let M be a nonnegative m by n matrix, 
and let A = X U Y be an optimal cover of M, where X is the set of 
rows and Y is the set of columns in A. The reduction M' of M by A 
is obtained from M as follows: Let t be the minimum matrix entry of 
M which is not covered by A. If the site (i,j) is not covered by A, we 
let M'(i,j) = M(i,j) — t. In particular, this means that M' will have 
a zero entry not covered by A. If the site (i, j) is doubly covered by A, 
that is, i G X and j G Y, then we let M'(i,j) = M(i,j) + 1. Finally 
if (i, j) is covered by exactly one file in A, we let M'(i,j) = M(i,j). 
Notice that the entries of M' are nonnegative. 

Lemma 4.4. Let M' be the reduction of M by the optimal cover X. A 
k-assignment which is optimal in M is also optimal in M 1 . 

Proof. Let t be the minimum of the entries in M that are not covered 
by A. For s < t, let M s be the matrix obtained from M by subtracting 
s from the non-covered entries and adding s to the doubly covered 
entries. Since M s has no zero entries except those of M, it follows from 
Theorem 14. HI that every optimal /c-assignment in M s must intersect 
every file of A. By continuity, it follows that there is some optimal k- 
assignment in M' that intersects every file in A. All ^-assignments that 
intersect every file of A are affected in the same way by the reduction 
from M to M', namely if [i is such a fc-assignment, then cost At (M / ) = 
cost M (M) — [k — \X\)t. Hence if [i is an optimal /c-assignment in M, 
then [i is optimal also in M' . □ 

From Lemma f4. II and Konig's theorem we can deduce the following: 

Lemma 4.5. There is an optimal cover of Z containing every row that 
belongs to some optimal cover of Z , and similarly there is an optimal 
cover that contains every column that belongs to some optimal cover. 

These covers are called the row-maximal and the column-maximal 
optimal covers, respectively. 

Proof. It follows immediately from Konig's theorem that a file belongs 
to an optimal cover of Z if and only if it intersects every maximal 



THE BUCK-CHAN- ROBBINS CONJECTURE 



7 



independent subset of Z. Let A be the set of rows that belong to some 
optimal cover of Z. Let Z\X be the set of sites in Z that are not covered 
by A. Let /x be a maximal independent subset of Z\X. Then by Lemma 
I4.1l there is a maximal independent subset // of Z which intersects every 
row that intersects fi. At the same time, // must intersect every row 
in A. Therefore rank(Z\A) = rank(/x) = rank(Z) — |A|. Hence A can 
be extended to an optimal cover of Z. □ 

We want to be able to do induction over matrix reduction. Therefore 
we need the following lemma: 

Lemma 4.6. Let M = Mq be a nonnegative m by n matrix, and let 
k < min(m, n). For i > 0, let Mj+i be the reduction of Mi by the 
column-maximal optimal cover of Mi. Then one of the matrices Mi 
has a zero cost k-assignment. 

Proof. Let Zi be the set of sites where Mj has zeros. Let \ be the 
column-maximal optimal cover of Zj. By Konig's theorem, Zq has an 
independent subset \i containing exactly one site in each file in A . 
Hence /i contains no site which is doubly covered by Ao- It follows 
that rank(Zi) > rank(Z ). Suppose that rank(Zi) = rank(Z ). Every 
column in Ai must belong to A . Consequently every row in A must 
belong to Ai. Now since M\ has a zero which is not covered by Ao, 
there has to be a row in Ai which is not in Ao- 

To sum up, in each step of the reduction process, either the rank of 
the set of zeros increases, or the number of rows in the column-maximal 
optimal cover increases. □ 

A feature of matrix reduction that has been exploited in several pa- 
pers [LWOOt IAS02j is that it keeps track of the cost of the optimal 
k- assignment. In fact, if t is as above and M reduces to M', then 
minfc(M) = {k — |A|) • t + minfc(M'). This means that we can compute 
minfc(M) recursively by iterating the reduction and keeping track of 
the values of t as well as the sizes of the optimal covers that are used. 
As long as the matrix entries are independent exponential variables, it 
is easy to compute the expected value of the minimum t, even for gen- 
eral m and n. However, since the doubly covered entries will eventually 
consist of sums of several dependent random variables, it becomes ex- 
tremely hard to reach any conclusions valid for general k through this 
approach. 

One of the key insights that led to the proof of the Parisi formula 
in |LW03j was the fact that information about the probability that a 
certain matrix element participates in the optimal assignment will give 
information about the expected minimum cost. However, a problem 
with the reduction algorithm is that in general, it loses track of the 
location of the optimal assignment. 



8 



SVANTE LINUSSON AND JOHAN WASTLUND 



Example 4.7. Here k = 2, and after the final step, the matrix con- 
tains two zero-cost 2-assignments, of which only one was optimal in the 
original matrix. 

A 2 3\ /0 1 2\ {columnl} /0 1\ {rowl} A) l\ 

\3 4 3) \2 3 2j ' \2 2 l) * \1 1 OJ 

The approach taken in this paper builds on an observation that has 
largely been overlooked, even in [LW03J, namely that when the column- 
maximal optimal cover is used, matrix reduction keeps track of the set 
of rows that intersect the optimal fc-assignment. 

5. The participation probability lemma 

In this section we prove a slightly refined version of a lemma which 
first occurred in [LWOOJ. This lemma describes the probability that a 
certain exponential variable participates in the optimal solution to a 
random assignment problem. 

Lemma 5.1 ( LWOQ]). Let M be a random matrix where a particular 
entry M(i,j) ~ exp(ct) is independent of the other matrix entries. Let 
M' be as M except that M'(i,j) = 0. Then the probability that (i,j) 
belongs to the optimal k-assignment in M is 

a ■ (E[min fc (M)] - E[min fe (M')]) . 

Proof. We condition on all entries in M except M(i,j). Let M t be the 
deterministic matrix obtained by also conditioning on M(i,j) = t. Let 
f(t) = minfc(M t ). Then either / is constant, or / increases linearly 
up to a certain point after which it is constant. The key observation 
is that the site belongs to the optimal /c-assignment in M t if and 
only if f'{t) = 1 (disregarding the possibility that t is equal to the 
point where / is not differentiable) . Therefore if x ~ exp(a), then the 
probability that belongs to the optimal ^-assignment in M x is 

equal to E[/'(x)]. By partial integration we have 

POO 

(6) E[f'(x)} = a / e~ at f(t)dt = 
Jo 

poo POO 

a d (e- at f(t)) +a 2 e~ at f{t)dt 
Jo Jo 

= -af(0) + aE[f(x)} = a ■ (E[min fc (M)] - E[min fc (M')]) . 

□ 

6. The Buck-Chan-Robbins urn model 

The following urn model is described in BCR02J: An urn contains a 
set of balls, each with a given positive weight. Balls are drawn one at 
a time without replacement, and each time the probability of drawing 
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a particular ball is proportional to the weight of the ball. This simple 
model has perhaps been studied before, but the connection to random 
assignment problems is due to Buck, Chan and Robbins. 

To each weighted set we can associate such an urn process. Here we 
take as our weighted set the set R of row indices (in order not to make 
any secret of the kind of application we have in mind). We consider a 
continuous time version of this process. Each ball (row) i remains in 
the urn for an amount of time which is exp(u>#(i))-distributed, and the 
times at which the balls leave the urn are all independent. 

The urn process is described by a continuous time random walk 
u R : R + — > 2 R on the power set of R. For t > 0, u R {t) is the set of 
balls that have been drawn at time t. 

If X C R, we denote by Pr R (X) the probability that this random 
walk reaches X, that is, the probability that every ball in X is drawn 
before every ball not in X. 

Example 6.1. If there are three balls labeled 1, 2, 3 then 
Pr„(t)) = Pr R ({l, 2,3}) = 1, 

w R ({l,2,3}) 

and 

w R (l)w R (2) w R (l)w R (2) 



Pr fl ({l,2}) 



w R ({l, 2, 3} )w R ({2, 3}) w R ({l, 2, 3})w({l, 3}) ' 

since the set {1, 2} can be obtained either by first choosing 1 and then 
2, or the other way around. 

By an order ideal (or just ideal for short) we mean a family of sets 
of balls which is closed under taking subsets. 

Lemma 6.2. Suppose that I is an order ideal and further that G / 
and R £ I. Then 

(7) w R (i)PrR(X) = 

Xu{i}(£I 

Proof. The random walk u R starts in which is in J, and ends in R 
which is not in I. Since I is an order ideal, there will be exactly one 
step of the walk which leads from a set in / to a set not in /. The left 
hand side of (J7J) sums the probabilities of leaving I via a certain step, 
taken over all possible ways of leaving /. □ 

If I is an ideal, and R £ J, then we let exit R (I) be the random subset 
of R which is the first set in the urn process which does not belong to 
I. For i G R, we denote by I\i the ideal consisting of all sets in I which 
do not contain i. 
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Lemma 6.3. IfiER then 

pr(i e ex itR (/)) = y w "® Pr J x \ 

Proof, i G exit^(J) if and only if the ball i is drawn at a moment 
where the set of balls already drawn belongs to /. Therefore we get the 
probability of this event by summing the probability of first arriving 
at X and then drawing i in the next step, over all sets X in □ 

We can also describe this probability inductively in terms of the 
corresponding probability for smaller ideals. If I is an ideal and % G R, 
we let I/i = {X C R : X U {i} G /}. 

Lemma 6.4. If I is an ideal such that G / and R ^ I, and iq G R, 

then 

Pr(t G exit fl (/)) = ^4l5T + — TdT ]C ^(«)^(i G exit K (J/i)). 
W R {K) W R {KJ t—' 

Proof. The first term in the right hand side is the probability that «o 
is the first ball to be drawn. The probability that ball i is the first ball 
to be drawn is wr(i)/wr(R), and given that this is the case, the ball 
i belongs to exit^(J) if and only if it belongs to exit^(J/z). □ 

If / is an ideal, we let T R (I) = inf(t : u R {t) ^ /) denote the exit 
time of /, in other words the time at which the random walk u R leaves 
/, or equivalently the amount of time it spends in I. The following 
formula for the expected value E[Tr(/)] of the exit time follows from 
the observation that the amount of time spent in / is equal to the sum 
of the time spent at each X G /. 

Lemma 6.5. 

E[T s (/)] = £^. 

Proof. Given that the walk reaches X, the expexted amount of time 
until another ball is drawn is equal to 

1 



w R (X) 

□ 

7. A FORMULA FOR THE PARTICIPATION PROBABILITY OF A ROW 

In this section we obtain a connection between the random assign- 
ment problem and the urn model by deriving a formula for the prob- 
ability that a certain row intersects an optimal fc-assignment. The 
special case of matrices without zero entries was proved in [BCR02 . 
Another special case, that of row- and column-weights equal to 1 (rate 
1 exponential variables) was proved in [LW03J by a different method. 
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If M is a nonnegative random matrix, we let Pk{M) be the (random) 
set of rows that intersect some optimal fc-assignment in M. If Z is a 
set of sites, we let r(Z) be the set of rows in the row- maximal optimal 
cover of Z. Moreover, if k is a positive integer, we let Ik{Z) be the 
ideal of all sets of rows which are partial k — 1-covers of Z. 

Lemma 7.1. X G I k {Z) iff X U r{Z) G I k (Z). 

Proof. We prove this by induction on the size of X. The induction 
step is equivalent to proving that the statement holds when X consists 
of one row, say X = {i}. If i G r(Z), then X = X U r(Z), so the 
statement is obvious. If % ^ r(Z), then let Z' be the set of sites in 
Z which are not in row i. By Konig's theorem, rank(Z') = rank(Z). 
Hence an optimal cover of Z, in particular the row-maximal one, is also 
an optimal cover of Z'. □ 

Corollary 7.2. Let Z be a set of sites, and let (i, j) be a site such that 
rank(Z U = rank(Z) + 1. Then I k (Z U = I k (Z)/i. 

Proof. Since % G r(Z U {(z, j)}), this follows from Lemma \7. II □ 

The following theorem establishes the connection between the urn 
process and the random assignment problem. Thereby it forms the 
basis for our approach, and in a sense it is the central theorem in the 
paper. 

Theorem 7.3. Let M be an m by n random matrix indexed by weighted 
sets R and C . Suppose that M has the following properties: There is a 
specified set Z of sites where the entries in M are zero. The remaining 
entries in the rows in r(Z) are positive real numbers. Fori G R, j G C, 
if i r{Z) and ^ Z, then M(i,j) is exp(wn(i)wc(j))- distributed 
and independent of the other matrix entries. Suppose that a certain 
row io has no zeros. Then 

Pr(t G Pk (M)) = Pr(i G exit R (I k (Z))). 

In our applications of this theorem, we are always dealing with stan- 
dard matrices. However, to make the inductive proof go through, we 
must condition on the values of the nonzero entries in the rows in r(Z). 
For this reason we let these entries be fixed numbers instead of random 
variables. 

Proof. We prove this by induction. Let A be the column-maximal op- 
timal cover of Z. Let M' be the reduction of M by A. Then there is at 
least one new zero in M', that is, a site which is not covered by 
A and such that M'(i,j) = 0. We let Z' = : M'(i,j) = 0}. We 

consider two cases. 

(1) All new zeros are in rows that belong to r(Z). In this case 
rank(Z') = rank(Z), and consequently r(Z') = r(Z). It follows im- 
mediately from Lemma 17. II that Ik(Z') = I k {Z). Hence by induction, 
Pr{i G G Pk {M)) = Pr{i G exit R (I k (Z))). 
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(2) There is a new zero M'(i,j) such that i ^ r(Z). Since M(i,j) has 
continuous distribution and is independent of all other matrix entries, 
we may assume that M'(i,j) is the only new zero in M'. Since the site 
is not covered by any optimal cover of Z, we have rank(Z') = 
1 + rank(Z). Hence i G r(Z') = r(Z) U {i}. 

If i — io, then every optimal fc-assignment in M' must intersect row 
io- Since every optimal fc-assignment in M is optimal in M', every 
optimal /c-assignment in M must intersect row io- If on the other hand 
i 7^ io, then with probability 1, row io participates either in all or in 
none of the optimal fc-assignments in M' . If we condition on the values 
of M' in row i, then M' satisfies the criteria of the theorem. Hence 
by induction, Pr(i G pk(M')) = Pr(i G exitjj(/fe(Z'))). By Corollary 
17. 21 we have Ik(Z') = Ik(Z)/i. Therefore if we condition only on being 
in case 2, then 

(8) Pr{io G p k {M)) = 

^7=r + ^i=r E w R (i)Pr(ioEexit R (h(Z)/i)), 
w R {r(Z)) w R {r(Z)) . 0{z) 

which by Lemma is equal to Pr(i G exit R (Ik(Z))). □ 



8. The two-dimensional urn-process 

At this point we introduce a kind of product of two urn processes. 
We consider two independent urn processes on the weighted sets R and 
C respectively. What we here call the two-dimensional urn-process is 
just a piece of notation that makes it easy to state the generalization of 
the Buck-Chan Robbins formula. The weight function is multiplicative: 
If X C R and Y C C, then we let 

w RxC (X,Y) = w R (X)w c (Y). 

Time is two-dimensional, and we let 

URxc(x,y) = (u R (x),u c (y))- 

We further let 

Pr RxC (X,Y) = Pr R (X)Pr c (Y). 

Since the two one-dimensional processes are statistically independent, 
P r Rxc(X, Y) is equal to the probability that there exists a point (x, y) 
in the time plane such that u Rx c(x,y) = (X,Y). 

Let J be an order ideal in 2 R x 2 C . In analogy with the one- 
dimensional exit time, we define the two-dimensional exit time T Rx c(J) 
to be the amount of two-dimensional time spent in J, that is, the area 
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of the region given by u Rx c(x,y) £ J- We have 

(9) E[T RyC (J)]= E ^ffff - 

As indicated in the figure, given that the random process reaches 
(X, Y), the expected amount of time spent there is l/w Rx c{X , Y). 




9. A formula FOR E[min fc (M)] 

Let M be a standard matrix with rows and columns indexed by the 
weighted sets R and C. Let Jk{M) be the ideal consisting of all partial 
k — 1-covers of the zeros of M. In this section we show that the expected 
cost of the minimal fc-assignment in M is simply equal to the expected 
ex it time of Jk{M) in the two-dimensional urn process. When M is 
a matrix, we will write Jfc(M) for Ik[Z), where Z is the set of zeros of 
M. 

Theorem 9.1 (Main Theorem, probabilistic version). 

(10) E[min fc (M)] = E[T RxC (J k (M))]. 

The proof of Theorem 19.11 is inductive. We first prove that (jlOj) is 
consistent with the row participation formula. 

Lemma 9.2. Let M be a standard matrix where row i$ contains no 
zeros. Let M- 7 be obtained from M by setting the entry in position 
(i ,j) equal to zero. If E[mm k (M J )] = E[T RxC ( J k (M 3 ))} for every j , 
then E[min fc (M)] = E[T RxC (J k (M))}. 
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Proof. By Lemma 15.11 the probability that the site (io,j) belongs to 
the optimal fc-assignment in M is 

w R (i )w c {j)(E[Tmn k (M)) - E[wm k (M>)]). 
Therefore, summing over j, 
Wr(i )J2 w gU) (E[min fc (M)] - E[min fc (M J ')]) = Pr{i G G Pk {M)). 

We divide by Wn(io) and use the fact that by Lemma lOl 

Pr{i, G Pk {M)) = Pr(z G exit R (4(M))) = V W ^) P M X ) _ 

xel%\i W ^ 

Hence we obtain 

(11) ^^c(j)(E[min fc (M)]-E[min fc (M^)])= £ 

jec xel k (M)\i w[ * > 

It is clear that we can solve for E[minfc(M)] in (jlljl . To finish the 
induction step, it is therefore sufficient to prove that 
(12) 

If we fix a set X G I k (M)\i , then by Lemma IFT21 applied to the 
ideal {Y : (X, F) G J fe (M)}, we have 

^ ^ wc(j)Prc(Y) 

(X,Y)<=J k (M) (X,Yu{j})<£J k (M) 

Since (io,j) is the only zero of M J in row io, a set of files not con- 
taining row io can be extended to a k — 1-cover of the zeros of M- 7 if 
and only if this can be done while making use of column j. If we want 
to cover the zeros of M- 7 as efficiently as possible, there is no point 
in using row i if instead we can use column j. Therefore the condi- 
tion (X,Y U {j}) ^ Jk{M) on j in the inner sum can be replaced by 
(X,Y) £ J k {M*). _ 

If we multiply by Prii(X)/wR(X) and sum over all X G I&(M)\zo, 
we see that the right hand side of (jUj) equals 

xe^\ i0 y J w R {X)w c {Y) 

(X,Y)EJ k (M) {X ,Y)<jtJ k (Afi) 

Here we can drop the conditions on X, since the inner sum will be 
empty unless X G Ik{N)\i Q . After changing the order of summation 
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so that the sum over j becomes the outer sum, this is equal to 

El .\ P r Rxc(X,Y) 
jec (x,Y)^MM) w ^c(X,Y) 

(X,Y)£J k (Mi) 

By (JHJ), this is equal to the left hand side of (fT2j) . □ 

Secondly, we show that (fTU|) is consistent with removing a column 
that contains at least k zeros. 

Lemma 9.3. Suppose that E[min fc _ 1 (M)] = E[T RxC ( J fc _i(M))] for ev- 
ery standard matrix M , in other words, suppose that ffTUj) holds when 
k is replaced by k — 1 . Let M be a standard matrix that has a column 
with at least k zeros. Then E[min&(.M)] = E\T Rx c(Jk(M))]. 

Proof. Suppose that column jo, has at least k zeros. Let M' be the m 
by n — 1 matrix obtained by deleting the column jo of M. Since every 
k — 1 -assignment in M' can be extended to a fc-assignment in M by 
including a zero in column jo, we have E[min^(M)] = E[minfc_i(M')]. 
To prove the lemma, we therefore show that with the obvious coupling 
of the urn processes, T RxC {Jk{M)) = T RxC \{j }(Jk-i{M')). 

Since there are k zeros in column j , every k — 1-cover of M must 
include j . Therefore (X,Y) G J k {M) if and only if (X,Y\{j }) G 
Jfc_i(M'). The lemma follows. □ 

We are now in a position to prove that fllOl) holds whenever m is 
sufficiently large compared to k. 

Theorem 9.4. If M is a standard m by n matrix with m > (k — l) 2 , 
then E[min fc (M)] = E[T RxC (J k (M))}. 

Proof. By Lemmas 19.21 and 19. 3\ it is sufficient to prove that the state- 
ment holds when M has at least one zero in each row, and no column 
with k or more zeros. In this case each column can contain the left- 
most zero of at most k — 1 rows. Since there are more than (k — l) 2 
rows, there must be at least k columns that contain the leftmost zero 
of some row. This implies that there is a zero cost ^-assignment in 
M. Consequently there is no k — 1-cover, that is, Jfc(M) = 0. Plainly 
E[min fc (M)] = = T RxC {$) = E[T RxC (J k (M))}. □ 

Finally we prove that (|10|) holds also for smaller matrices by taking 
the limit as the weights of the exceeding rows tend to zero. 

Proof of Theorem W. 11 We prove (fTTH) by downwards induction on the 
number of rows. Suppose that M is a standard m by n matrix. Let 
M e be an augmented matrix of m + 1 rows and n columns, so that 
the first m rows equal M, and row m + 1 has no zeros and weight 
w R (m + 1) = e. When e is small, the entries of row m + 1 are large. 
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With high probability, none of them will participate in the optimal 
fc-assignment, and consequently 

E[min fc (M)] = limE[min fc (M £ )]. 

We therefore have to show that 

E[T RxC (J k (M))} = \imE[T RxC (J k (M e ))}. 

Since row m + 1 does not have any zero, we have T Rx c(Jk(M e )) < 
T RxC (Jk(M)) under the obvious coupling of the corresponding urn pro- 
cesses. Hence we can squeeze E[T RxC ( J k (M e ))] by 

(1 - p)E[T RxC (J k (M))] < E[T RxC (J k (M e ))] < E[T RxC (J k (M))], 

where p is the probability that there is a point (x, y) in the time plane 
such that u R (x),uc{y) is a partial k — 1-cover for M but not for M e . 
The only way we can have a partial k — 1 cover for M and not for M e 
is if the row m + 1 has been drawn. However, as e — > 0, the probability 
that row m + 1 is drawn among the first k — 1 rows goes to zero. Hence 
as e tends to zero, so does p. The theorem follows. □ 

10. Proof of the Buck-Chan-Robbins formula 

We prove that the two formulations of the main theorem are indeed 
equivalent. We have 



(13) E 



{x ,Y)eJ k (M) MX)w c (Y) 

£ (-**((*, n i)) • nT R ({x' C X})] ■ E[T c ({r c 

(X,y)GJ fc (Af) 

since the factors 

1 1 



w R (Xy w c (Y) 

can be interpreted as the expected exit times of the ideals {X' C X} 
and {Y' C F} of all subsets of X and Y respectively. By Lemma KT51 



(14) £ -^'^ 



(x,y )e j fc( M) MX)w c (Y) 

E -M(x,ni)E^E^ 

(x,y)eJ fe (M) x'ci W M A J ycy ™c(/ J 

We now change the order of summation and get 

(15) 2^ — 7^ — 2^ -M(^^),i)- 

(X',y)eJ fc (A/) Wi? ( A ^°v y ' {X>,Y>)<{X,Y)<1 
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In the factor to the right, we are summing over all (X, Y) that satisfy 
(X',Y') < (X,Y) < 1 in the poset J k (M). By the definition of the 
Mobius function, this sum is equal to 1, so that we can drop this factor. 
Hence ()15|) is equal to 

V Pr R (X')Pr c (Y') 

{x >,Y>)eMM) W ( X ) W ( Y ) 

This specializes to the equivalence of the two formulations of the 
main conjectures in |LW00j and of the two formulations in [BCR02 . 
The more general setting here allows us to give a shorter proof. 

To finish the proof of Theorem 12.11 we cite a well-known theorem 
(G-C Rota?) that states that the Mobius function of the truncated 
Boolean lattice is indeed given by the binomial coefficient occurring in 
©• 

Theorem 10.1. Let P be a poset consisting of the elements of rank 
0, . . . , k — 1 in a Boolean lattice of degree N, together with an artificial 
top element 1. Then 

M04) = (-i) fc (^r v 

Sketch of proof . Every interval of the form (0,x) for x ^ 1 is Boolean, 
and therefore n(0,x) = (— i) rank E )_ Hence 

fc - x /TUN 

M0, i) = - E (-i) mnkW = -£ (-iy . ■ 

0<x<l i= 

The theorem now follows immediately by induction on k. □ 
11. Specializing to rate 1 variables 

In this section we briefly comment on the implications of Theorem 
19.11 to the case of rate 1 variables. In particular, we show that the 
Coppersmith-Sorkin formula follows. When there are no zeros in the 
matrix, the ideal Jk{M) consists of all sets of at most k — 1 files. 
Moreover, when all files have the same weight, the files become indis- 
tinguishable in the urn process. The expected amount of time until the 
next ball is drawn depends only on the number of balls already drawn, 
and not on which particular balls have been drawn. Therefore we ob- 
tain the expected value of T(J k (M)) by simply conditioning on every 
step in the urn process taking exactly its expected amount of time. 
The expected amount of two-dimensional time that the process spends 
at the point where exactly i rows and j columns have been drawn is 
equal to 

1 

(m — i){n — j) 
From this, the Coppersmith-Sorkin formula follows. 



18 



SVANTE LINUSSON AND JOHAN WASTLUND 



As n — > oo, the process can be approximated by continuous expo- 
nential decay. At time t, a fraction of e~ l of the balls will remain in 
the urn. In the two-dimensional process, the borderline at which the 
process exits the ideal J n (M) will approach the curve given by the 
equation 

e - x + e- y = 1. 

Hence we may obtain the limit value as the area under this curve, which 
is indeed equal to C(2)- 
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